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Abstract. We propose definitions of QAC°, the quantum analog of the 
classical class AC of constant-depth circuits with AND and OR gates 
of arbitrary fan- in, and QACC°[q], where n-ary MOD q gates are also 
allowed. We show that it is possible to make a 'cat' state on n qubits in 
constant depth if and only if we can construct a parity or MOD2 gate 
in constant depth; therefore, any circuit class that can fan out a qubit 
to n copies in constant depth also includes QACC°[2]. In addition, we 
prove the somewhat surprising result that parity or fanout allows us to 
construct MOD q gates in constant depth for any q, so QACC [2] = 
QACC . Since ACC°[p] / ACC°[q] whenever p and q are mutually 
prime, QACC [2] is strictly more powerful than its classical counterpart, 
as is QAC° when fanout is allowed. 



1 Introduction 

The theory of circuit complexity has become an important branch of theoretical 
computer science. Shallow circuits correspond to parallel algorithms that can 
be performed in small amounts of time on an idealized parallel computer, and 
so circuit complexity can be thought of as a study of how to solve problems 
in parallel. In addition, some low-lying circuit classes have beautiful algebraic 
characterizations, e.g. 

In |ri| , [r^ |, Moore and Nilsson suggested a definition of QNC, the quantum 
analog of NC, the class of problems solvable by circuits with polylogarithmic 
depth and polynomial size. Here, we will study quantum versions of some addi- 
tional circuit classes. Recall the following definitions: 

1. NC fc consists of problems solvable by families of circuits with depth 0(log fc n) 
and size polynomial in n, where n is the size of the input, where the allowed 
gates are ANDs and ORs with two inputs (which may be negated). 

2. AC fc is like NC fc , but where we allow AND and OR gates with unbounded 
fan-in, i.e. arbitrary numbers of inputs, in each layer of the circuit. 

3. ACC k [q] is like AC fc , but where we also allow MOD g gates with unbounded 
fan-in, where MOD g (ii, . . . , x n ) outputs 1 iff the sum of the inputs is not a 
multiple of q. 

4. ACC fc = U q ACC k [q}. 

5. NC = U fe NC fc = U fc AC fc = U fc ACC k . 



Then we have 



AC c ACC°[2] c ACC° C NC 1 C • • • C NC 

In fact, these first two inclusions are known to be proper ||,[^ [[4[j]l| . Neither 
Majority nor Parity are in AC , while the latter is trivially in ACC°[2]. In 
addition, ACC>] and ACC°[g] are known to be incomparable whenever p and 
q are mutually prime. Thus these classes give us some of the few strict inclusions 
known in computational complexity theory. 

There is significant evidence that quantum complexity classes are often more 
powerful than their classical counterparts. The best-known is Shor's factoring 
algorithm |Pij ], showing that integer factoring is in BQP, the quantum analog of 
bounded-probability polynomial time. Other results lower down on the compu- 
tational hierarchy include the recognition of non-regular languages by two-way 
quantum finite automata with bounded probability |Sj] and quantum context-free 
languages that are not classically context-free ppf . 

In this paper, we prove a number of results about QAC and QACC, and ad- 
dress some definitional difficulties. We show that an ability to form a 'cat state' 
with n qubits, or fan out a qubit into n copies in constant depth, is equivalent 
to being able to constrict an n-ary parity gate in constant depth. Furthermore, 
we show the somewhat surprising result that either of these allows us to build 
MOD g gates in constant depth for any q, so QACC [2] = QACC . This is 
markedly different from the classical situation mentioned above. Finally, we dis- 
cuss how best to compare these circuit classes to classical ones, and conclude that 
QACC [2], and a form of QAC we call QAC^, are strictly more powerful 
than ACC°[2] and AC . 



2 Definitions 



We will use the notation in figure [l] for our various gates. The n-ary Toffoli gate, 
or simply the AND gate, negates the target qubit if all the input qubits are true. 
The n-ary controlled-C/ gate applies a unitary transformation U to the target if 
all the inputs are true, and the MOD, gate negates the target if the number of 
true inputs is not a multiple of q. Note that all these are reversible. For three 
inputs, for instance, these matrices can be written 
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(this last is a MOD 2 gate) where 1 = (o^ij and X 
the symmetric phase shift gate, shown here for two qubits 
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Fig. 1. Our notation for n-ary Toffoli, controlled-!/, and MOD 9 gates, fanout 
gates, symmetric phase shift gates, and the Hadamard gate. On the top right, 
we show a useful identity between the controlled- not, the controlled 7r-shift, and 
the Hadamard gate. On the bottom right we show a controlled-!/ gate with one 
of its inputs negated by conjugation with X. 

is a special case of the controlled-C/ gate, and the one-qubit Hadamard gate 
H = -j= ^ . Note that the controlled 7r-shift is simply the controlled-not 

with the target qubit conjugated with H, and that H 2 = 1. Finally, we will 
also sometimes use controlled-!/ gates with some inputs negated, which can be 
performed by conjugating that input with X . All of these are shown in figure [|. 
Then our definition of QAC and QACC is as follows: 

Definition. A family of unitary operators F„ S U(2 n ) is in QAC fc if F n can be 
written as a product of C(log fc n) operators, each of which is a tensor product 
of one-qubit gates and Toffoli gates acting on disjoint sets of qubits. Each such 
tensor product is called a layer. The definition of QACC fc [<7] is similar, except 
that MOD, gates are also allowed. Finally, QACC fc = U q QACC fc [q]. 

We also allow our circuits to use a polynomial number of ancillae or work 
qubits. However, for the final measurement to be correct, these ancillae have to 
be unentangled from the qubits of the final state that we care about. Therefore, 
we require that the ancillae start and end in a pure state |0), in which case 
our desired operator occurs as a diagonal block of the larger operator on this 
subspace Jll]]. This also allows us to re-use ancillae. 

Note that n-ary controlled-!/ gates can be performed in constant depth with 
one ancilla, by writing them as a product of two Toffoli gates and a two-qubit 
controlled-!/ gate with one input, which can then be written as a product of 
controlled-nots and one-qubit gates Thus our definition of QAC remains 
the same if we allow n-ary controlled-!/ gates in each layer. Similarly, allowing 
MOD, controlled-!/ gates, which apply U to the target qubit if the number of 
true inputs is not a multiple of q, does not change our definition of QACC. 

It is, of course, tempting to pronounce these classes 'quack.' 



3 Fanout, Cat states and parity 

To make a shallow parallel circuit, it is often important to fan out one of the 
inputs into multiple copies. One of the differences between classical circuits and 
quantum ones as we have defined them here is that in classical circuits, we usually 
assume that we get arbitrary fanout for free, simply by splitting a wire into as 
many copies as we like. This is difficult in quantum circuits, since making an 
uncntangled copy requires non-unitary, and in fact non-linear, processes since 

HO) + /3|1)) g> (a|0) + (3\1)) = a 2 \00) + a/3(|01) + |10)) + /3 2 |11) 

has coefficients quadratic in a and (3. This is the so-called 'no cloning' theorem. 

However, the controlled-not gate can be used to copy a qubit onto an ancilla 
in the pure state |0) by making a non-destructive measurement: 

{a\0) +j9|l» <8 |0) -> a|00)+/3|ll) 

Note that the final state is not a tensor product of two independent qubits, since 
the two qubits are completely entangled. This means that whatever we do to one 
copy, we do to the other. Except when the states are purely Boolean, we have 
to treat this kind of 'fanout' more gingerly than we would in the classical case. 

By making n copies of a qubit in this sense, we can make a 'cat state' 
a 1 000 • • • 0) + • • • 1). Such states are useful in making quantum computa- 

tion fault-tolerant (e.g. @@). We can do this in log n depth with controlled-not 
gates, as shown in the left-hand part of figure]^. When preceded by a Hadamard 
gate on the top qubit, this will map an initial state |0000) onto a cat state 
^=(|0000) + 1 1 1 1 1 ) ) . However, we will also consider circuits which can do this 
in a single layer, with a 'fanout gate' that simultaneously copies a qubit onto n 
target qubits. This is simply the product of n controlled-not gates, as shown in 
the right-hand part of figure 0. 
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Fig. 2. Two ways to make a cat state on n qubits. The circuit on the left uses 
only two-qubit gates and has depth logn. On the right, we define a 'fanout gate' 
that simultaneously performs n controlled- nots from one input qubit. 



We now show that in quantum circuits, we can do fanout in constant depth 
if and only if we can construct a parity gate in constant depth. 



Proposition 1. In any class of quantum circuits that includes Hadamard and 
controlled-not gates, the following are equivalent: 



1. It is possible to map a\0) + /3\1) and n — 1 ancillae in the state |0) onto an 
n-qubit cat state a|000 • ■ - 0) + • • ■ 1} in constant depth. 

2. The n-ary fanout gate of figure [| can be implemented in constant depth with 
at most n — 1 additional ancillae. 

3. An n-ary parity or MODi gate as defined above can be implemented in con- 
stant depth with at most n — 1 additional ancillae. 



Proof. First, note that (1) is a priori weaker than (2), since (1) only requires 
that an operator map |000 • • • 0) to |000 • • • 0) and 100 • • • 0) to | 111 • • • 1). In fact, 
the two circuits shown in figure ||both do this, even though they differ on other 
initial states. 

To prove (2 3), we simply need to notice that the parity gate is a fanout 
gate going the other way conjugated by a layer of Hadamard gates, since parity is 
simply a product of controlled-nots with the same target qubit, and conjugating 
with H reverses the direction of a controlled-not. This is shown in figure [| 
Clearly the number of ancillae used to perform either gate will be the same. 

To prove (1 => 3), we use a slightly more elaborate circuit shown in figure 
|]. Here we use the identity shown in figure [l] to convert the parity gate into a 
product of controlled 7r-shifts. Since these are diagonal, they can be parallelized 
as in JIl[ by copying the target qubit onto n — 1 ancillae, and applying each one 
to a different copy. While we have drawn the circuit with a fanout gate, any gate 
that satisfies the conditions in (1) will do. 

Finally, (2 =>■ 1) is obvious. □ 




Fig. 3. The parity and fanout gates are conjugates of each other by a layer of 
Hadamard gates. 

This brings up an interesting issue. It is not clear that AC C QAC° as 
we have defined QAC° here, since we are not allowing arbitrary fanout in each 
layer. An alternate definition, which we might call QAC with fanout or QAC wf , 
would allow us to perform controlled-t/ gates or Toffoli gates in the same layer 
whenever they have different target qubits, even if their input qubits overlap. 
This seems reasonable, since these gates commute. Since we can fanout to n 
copies in log n layers as in figure pi we have QAC fe C QAC^ C QAC fe+1 . We 
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Fig. 4. The parity gate can also be written as a product of controlled 7r-shifts, 
with the target qubit conjugated by H. Since these are diagonal, we can paral- 
lelize them using any gate that can make a cat state. 



can define QACC wf in the same way, and Proposition B] implies that QAC(j, f = 
QACCt f [2] = QACC*[2]. 

We leave it to the reader to decide which definition, QAC° or QAC° f , is a 
better analog of AC . 

4 Parity or fanout gives MOD, gates 

As stated above, in the classical case MOD p and MOD g gates are not easy to 
build from each other whenever p and q are relatively prime. In fact, to do it in 
constant depth requires a circuit of exponential size [fl9[ . In this section, we will 
show this is not true in the quantum case. Specifically, with n-ary parity gates 
we can build MOD 9 gates in constant depth. 

Proposition 2. In any circuit class containing n-ary parity gates and one-qubit 
gates, we can construct an n-ary MOD q gate, with n [log q] ancillae, in depth 
depending only on q. 

Proof. Let k — [log q] , and let M be a Boolean matrix on k qubits where the 
zero state has period q. For instance, if we write \x) for < x < 2 k as shorthand 
for \xk-i ■ ■ ■ x±xo) where the Xi are the digits of it's binary expansion, then let 
M permute the \x) as follows: 

M\x) = i\^ + 1)m0dq) 

I \x) it x > q 

Then if we start with k ancillae in the state |0) and apply a controlled-M gate 
to them from each input, the state will differ from |0) on at least one qubit if 
and only if the number of true inputs is not a multiple of q. (Note that this 
controlled-M gate applies to k target qubits at once in an entangled way.) We 
can then apply an n-ary OR to these k qubits, i.e. a Toffoli gate with its inputs 
conjugated with X and its target qubit negated before or after the gate, and 
return the k ancillae to |0) by applying the inverse series of controlled-M''' gates. 



Now we use Proposition 4 of |Tl|] to parallelize this set of controlled-M gates. 
We can convert them to diagonal gates by conjugating the target qubits with a 
unitary operator T, where T* DT = M and D is diagonal. If we have a parity 
gate, we can fan out the k ancillae to n copies each using Proposition |l|. We 
can then simultaneously apply the n controlled- D gates from each input to the 
corresponding copy, and then uncopy them back. 

This is shown in figure |[ For q = 3, for instance, we have M — 
T = 73 e 4 " /3 <A 3 ! , and L> = 1 e ^ i/3 

Ve 2 -/ 3 e 4 -/ 3 1 / V 

The operators T, T>, and the controlled- D gate can be carried out in some 
finite depth by controlled-nots and one-qubit gates by the results of 0]. The 
total depth of our MOD g gate is a function of these and so of q, but not of n. 
Finally, the number of ancillae used is kn = n [log q] as promised. □ 

To look more closely at the depth as a function of q, we note that using the 
methods of Reck et al. [[l5| and Barenco et al. 0], any operator on k qubits can 
be performed with 0(k^4^) two-qubit gates. Since k = |~logg], this means that 
the depths of T, T' and the controlled- D are at most 0(q 2 log 3 q). 

Since we can construct MOD g gates in constant depth, we have QACC fc [g] C 
QACC fc [2] for all q, so QACC fe = QACC fe [2]. By proposition g, these are both 
also equal to QAC(j, f . In particular, we have 

QAC° f = QACC°[2] = QACC 

while classically all of these are strict inclusions. 

To compare these with classical circuits requires a little care. For any Boolean 
circuit with n inputs, m outputs, depth d, and width w, it is easy to construct a 
reversible circuit on n + m + k bits of depth 2d — 1 where n bits keep the input, 
m bits get xor'ed with the output, and k = wd ancillae start and end in the zero 
state. We do this by assigning an ancilla to each gate in the original circuit, and 
replacing each gate with a reversible one that xors that ancilla with the output. 
Then we can erase the ancillae by moving backward through the layers of the 
circuit. 

Then if we adopt the convention that a Boolean function with n inputs and 
m outputs is in a quantum circuit class if its reversible version is, we clearly have 
AC fe C QAC^f and ACC fc C QACC fe . Thus we have 

AC c ACC°[2] c ACC° C QAC° f = QACC [2] = QACC 

showing that QAC° f and QACC [2] are more powerful than AC and ACC°[2] 
respectively. 

Interestingly, if QAC° as we first defined it cannot do fanout, i.e. of QAC° C 
QAC^f , then in a sense it fails to include AC , since the fanout function from 
{0, 1} to {0, 1}™ is trivially in AC . However, it is not clear whether it fails to 
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Fig. 5. Building a MOD g gate. We choose a matrix M on fc = [logg] qubits 
such that M 9 = 1, apply controlled- M gates from the n inputs to k ancillae, 
perform an OR on the target qubit, and reverse the process. To parallelize this, 
we can diagonalize M by writing it as T'DT, fan the k qubits out into n copies 
each using Proposition^, and apply controlled- 1? gates simultaneously from each 
input to a set of copies. The total depth depends on q but not on n. 



include any AC functions with a one-bit output. On the other hand, if QAC 
can do fanout, it can also do parity and is greater than AC , so either way AC 1 
and QAC arc different. I am indebted to Pascal Tesson for pointing this out. 



5 Conclusion 

We have shown that fanout and parity are intimately related in quantum circuits, 
and that either of these makes it possible to construct a MOD g gate in constant 
depth for any q. Thus QAC° f = QACC°[2] = QACC , and so QAC° f and 
QACC°[2] are more powerful than AC and ACC°[2] respectively. This adds 
another case where quantum classes are provably more powerful than their clas- 
sical cousins. 

We suggest three questions for further work: 

1. Is QAC = QAC° f ? That is, can the fanout gate be constructed in constant 
depth when each qubit can only act as an input to one gate in each layer? 

2. Is QACC [9] = QACC for all ql That is, can MOD g gates, where q is 
odd, generate fanout, parity, and MOD p gates for all other pi 

3. Is QAC° f = QTC°? That is, can the techniques used here be extended to 
construct threshold gates in constant depth? 

We conjecture that the answer to all these questions is 'no,' but quantum circuits 
can be surprising... 
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